The main task of our thesis is we implemented Document Information Retrieval System, which is a search engine that is used to index and search HTML document based on Eclipse plug-in mechanism. 我们论文的主要工作是实现了DIRS(DocumentInformationRetrievalSystem)系统,DIRS系统是基于EclipsePlug-in机制的一个对HTML文档进行检索并对搜索结果进行聚类的搜索引擎。